Design and optimization of DBSCAN Algorithm based on CUDA
نویسندگان
چکیده
DBSCAN is a very classic algorithm for data clustering, which is widely used in many fields. However, with the data scale growing much more bigger than before, the traditional serial algorithm can not meet the performance requirement. Recently, parallel computing based on CUDA has developed very fast and has great advantage on big data. This paper summarizes the algorithms proposed before and improves the performance of the old DBSCAN algorithm by CUDA. The algorithm uses shared memory as much as possible compared with other algorithms and it has very good scalability. A data set is tested on the algorithm of new version. Finally, we analyze the results and give a conclusion that our algorithm is approximately 97 times faster than the serial version.
منابع مشابه
An approach to Improve Particle Swarm Optimization Algorithm Using CUDA
The time consumption in solving computationally heavy problems has always been a concern for computer programmers. Due to simplicity of its implementation, the PSO (Particle Swarm Optimization) is a suitable meta-heuristic algorithm for solving computationally heavy problems. However, despite the simplicity, the algorithm is inefficient for solving real computationally heavy problems but the pr...
متن کاملبررسی مشکلات الگوریتم خوشه بندی DBSCAN و مروری بر بهبودهای ارائهشده برای آن
Clustering is an important knowledge discovery technique in the database. Density-based clustering algorithms are one of the main methods for clustering in data mining. These algorithms have some special features including being independent from the shape of the clusters, highly understandable and ease of use. DBSCAN is a base algorithm for density-based clustering algorithms. DBSCAN is able to...
متن کاملParallelization of Rich Models for Steganalysis of Digital Images using a CUDA-based Approach
There are several different methods to make an efficient strategy for steganalysis of digital images. A very powerful method in this area is rich model consisting of a large number of diverse sub-models in both spatial and transform domain that should be utilized. However, the extraction of a various types of features from an image is so time consuming in some steps, especially for training pha...
متن کاملImprovement of density-based clustering algorithm using modifying the density definitions and input parameter
Clustering is one of the main tasks in data mining, which means grouping similar samples. In general, there is a wide variety of clustering algorithms. One of these categories is density-based clustering. Various algorithms have been proposed for this method; one of the most widely used algorithms called DBSCAN. DBSCAN can identify clusters of different shapes in the dataset and automatically i...
متن کاملDesign Optimization of Axial Flux Surface Mounted Permanent Magnet Brushless DC Motor For Electrical Vehicle Based on Genetic Algorithm
This paper presents the design optimization of axial flux surface mounted Permanent Magnet Brushless DC motor based on genetic algorithm for an electrical vehicle application. The rating of the motor calculated form vehicle dynamics is 250 W, 150 rpm. The axial flux surface mounted Permanent Magnet Brushless DC (PMBLDC) motor was designed to fit in the rim of the wheel. There are several design...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1506.02226 شماره
صفحات -
تاریخ انتشار 2015